# Human Preference Alignment

Qwen3 32B GPTQ Int4
Apache-2.0
Qwen3 is the latest 8B parameter version of the Tongyi Qianwen series large language model, supporting thinking mode switching, multilingual processing, and tool invocation, with powerful reasoning and dialogue capabilities.
Large Language Model Transformers
Q
JunHowie
1,079
3
Qwen3 8B
Apache-2.0
Qwen3 is the latest 8B-parameter version in the Tongyi Qianwen series of large language models, supporting seamless switching between thinking and non-thinking modes with powerful reasoning, instruction following, and agent capabilities.
Large Language Model Transformers
Q
Qwen
550.09k
294
Summllama3.2 3B
Text summarization model initialized from Llama3.2-3B-Instruct, optimized through large-scale summarization feedback DPO training
Large Language Model Transformers
S
DISLab
441
36
Summllama3.1 8B
SummLlama3.1-8B is a text summarization model initialized from Llama3.1-8B-Instruct, optimized through large-scale summarization feedback via Direct Preference Optimization (DPO), excelling in fidelity, completeness, and conciseness.
Text Generation Transformers
S
DISLab
116
10
Llama 3.1 Nemotron 70B Instruct HF
A custom large language model by NVIDIA, designed to enhance the usefulness of responses generated by LLMs to user queries.
Large Language Model Transformers English
L
nvidia
29.98k
2,033
Summllama3 8B
SummLlama3-8B is a text summarization model initialized from Llama3-8B-Instruct, optimized through large-scale summarization feedback via DPO training, demonstrating excellent performance in faithfulness, completeness, and conciseness.
Text Generation
S
DISLab
15
14
Causallm 14B DPO Alpha GGUF
A 14B-parameter causal language model optimized with DPO, supporting English-Chinese text generation tasks
Large Language Model Supports Multiple Languages
C
tastypear
2,238
85
Causallm 7B DPO Alpha GGUF
A 7B-parameter large language model based on Llama 2 architecture, optimized through DPO training, supporting Chinese and English text generation
Large Language Model Supports Multiple Languages
C
tastypear
367
36
DISC MedLLM
Apache-2.0
DISC-MedLLM is a domain-specific large language model for medical dialogue scenarios developed by Fudan University's DISC Lab, built upon Baichuan-13b-base, providing high-quality health support services.
Large Language Model Transformers Chinese
D
Flmc
128
51
Eleuther Pythia6.9b Hh Sft
Apache-2.0
A causal language model based on the Pythia-6.9b foundation model, fine-tuned using Anthropic's hh-rlhf dataset for supervised training
Large Language Model Transformers English
E
lomahony
58
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase